An experimental evaluation of de-identification tools for electronic health records

نویسندگان

  • Jie Qian
  • Nafees Qamar
چکیده

The robust development of Electronic Health Records (EHRs) causes a significant growth in sharing EHRs for clinical research. However, such a sharing makes it difficult to protect patients’ privacy. A number of automated de-identification tools have been developed to reduce the re-identification risk of published data, while preserving its statistical meanings. In this paper, we focus on the experimental evaluation of existing automated de-identification tools, as applied to our EHR database, to assess which tool performs better with each quasi-identifiers defined in our paper. Performance of each tool is analyzed wrt. two aspects: individual disclosure risk and information loss. Through this experiment, the generalization method has better performance on reducing risk and lower degree of information loss than suppression, which validates it as more appropriate de-identification technique for EHR databases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adoption of Electronic Personal Health Records in Canada: Perceptions of Stakeholders

Background Healthcare stakeholders have a great interest in the adoption and use of electronic personal health records (ePHRs) because of the potential benefits associated with them. Little is known, however, about the level of adoption of ePHRs in Canada and there is limited evidence concerning their benefits and implications for the healthcare system. This study aimed to describe the current ...

متن کامل

Evaluation Measures for Detection of Personal Health Information

Texts containing personal health information reveal enough data for a third party to be able to identify an individual and his health condition. Detection of personal health information in electronic health records is an essential part of record deidentification. Performance evaluation in use today focuses on method’s ability to identify whether a word reveals personal health information or not...

متن کامل

De-identification of health records using Anonym: Effectiveness and robustness across datasets

OBJECTIVE Evaluate the effectiveness and robustness of Anonym, a tool for de-identifying free-text health records based on conditional random fields classifiers informed by linguistic and lexical features, as well as features extracted by pattern matching techniques. De-identification of personal health information in electronic health records is essential for the sharing and secondary usage of...

متن کامل

Identification of Effective Factors related to Implementation of Electronic Health Records in Imam Khomeini Hospital, Tehran

Background: With the advancement of science and emergence of new technologies for solving human health and medical problems, one of the most important applications of technology in the field of health care is creation of electronic health records. The purpose of this study was to determine the effective internal and external factors related to successful implementation of the electronic health ...

متن کامل

Access and Representation

Health related research is an interdisciplinary, broad and growing research area. With the growth of digitalised systems that simplify and make work processes more efficient in many companies and organisations, the amount of available data is now immense. The information contained in health related digital data sets could be used for further research and also, in the long run, for improving hea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1211.3836  شماره 

صفحات  -

تاریخ انتشار 2012